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Description 

This invention relates to methods and apparatus for encoding pictures, and can be advantageously employed for 
encodingpicturesby way of video data compression. ... 

Fig 1 shows the arrangement of a previously proposed device for encoding a moving picture by way of data com- 
oression With the picture encoding device shown in Fig.1 . digitized picture data of luminance components (Y). chroma 
SSonents (Cb) Li chroma components (Cr). with the numbers of pixels equal to 352(H) X 240(V) X 30 frames. 1 74 
f X 1 20m X 30 frames and 1 74(H) X 1 20(V) X 30 frames, respectively, are fed to an input terminal 1 . 

The input picture data, entering the input terminal 1 . is sent to a motion detector 20 and a block division unit 22 via 
a frame memory 10 configured for transiently storing the input picture data and re-arraying the picture data according 

^The bto^rd"i^teki unit 11 divides luminance components (Y) and chronfia components (Cr). (Cb) of each frame 
supplied from the frame merrrary 10 into 8X8 pixel blocks, as shown in Fig.3. The four blocks of the luminance compo- 
nents (YO, Y1 . Y2 and Y3). one chroma block (Cb) and one chroma block (Cr). totalling six bkjcks (YO. Y1. Y2. Y3. Cb 
and Cr). are termed a macro-block. 

The macro-bk)c-based data from the block divisfon unit 11 are sent to a subtractive unit 12. 
The subtractive unit 12 finds a difference between data from the bkxjk division unit 11 and inter-frame predictively 
coded picture data as later explained and sends the resulting difference to a fixed terminal b of a changeover switch 1 3 
as data for mter-frame predictive coding, as explained later. The data from the block divisfon unit 11 is supplied to the 
other fixed terminal a of the changeover switch 1 3 as data of a frame for intra-f rame coding, as explained later 

The bkwk-based data from the changeover switch 13 is transformed by DCT by a OCT circuit 14 from which the 
resulting DCT coefficients are sent to a quantizer 15. The quantizer 15 quantizes the DCT output with a pre-set quan- 
tizatkjn step width and the resulting quantized coefficients are sent to a zig-zag scan circuit 16. . ^. . ^ 

The zig-zag scan circuit 16 re-arrays the quantized coefficients according to zig-zag scan as shown in Fig.4 and 
sends the resultingoutput to a variable length encoding circuit 17. The variable length encoding circuit 17 vanable lerigth 
encodes output data of the zig-zag scan circuit 16 and sends the resulting output to an output buffer 18 while sending 
the informatton specifying the quantity of codes generated by variable length encoding to a quantization step controller 
1 9 The quantization step controller 1 9 controls the quantization step width of the quantizer 1 5 based upon the information 
specifying the quantity of codes from the variable length encoding circuit 17. Output data of the output buffer 18 is 
outputted as a compressed coded output at an output terminal 2. 

An output of the quantizer 15 is de-quantized by a de-quantizer 27 and inverse-transformed by an inverse DCT 
circuit 26 An output of the inverse DCT circuit 26 is sent to an addition unit 25. 

The addition unit 25 is also fed with inter-frame predictively coded picture data from a motion compensation unit 21 
viaachangeover switch 24 whk* is tumed on for a frame produced by inter-frame predictive coding. Thus the inter-frame 
predicted picture data is summed to the output data of the inverse DCT circuit 26. Output data of the addition unit 25 is 
terTHX)rarily stored in a frame memory 22 and thence supplied to a motion compensation unit 21. 

The motkjn compensation unit 21 effects motion compensation based upon the motk>n vector detected by themotion 
detection unit 20 and outputs the resulting inter-frame predictively-coded pteture data. 

An illustrative sequence of operatkms of the picture encoder shown in Fig.1 is explained in detail. For convenience 

in explanatbn. the f oltowing appellation is used for the respective frames. 

The frames arrayed in the display sequence are termed 10. B1 . B2. P3. B4. B5. P6. B7. 88. 19. B1 0. B1 1 . B12. .. 
Of these frames, I. P and B refer to the sorts of the methods for data compression, as later explained, and the numerals 
next to I , P and B simply indicate the display sequence. 

For compressing these ptetures, the MPEG 1 of the MPEG (Moving Picture Expert Group) which is a worit group 
for intemational standardization of the color moving picture encoding system, provides the following. 

First, the picture 10 is compressed. Next, the pteture P3 is compressed. At this time, it is not the picture P3 itself but 
difference data between P3 and 10 that is compressed. 

Next the picture B1 is compressed. At this time, it is not the picture B1 itself, but difference data between the pictures 
B1 and 10. difference data between pictures B1 and 10. between pictures B1 and P3 or between the picture B1 and mean 
values of the pictures 10 and P3. whichever is smaller in the information volume, that is compressed. 

Next the picture B2 is compressed. At this time, it is not the picture B2 itself, but difference data between the pictures 
B2 and 10. difference data between pictures B2 and P3 or a difference between the picture B2 and the mean values of 
the pictures 10 and P3, whichever is smaller in the information volume, that is compressed. 

Next, the picture P6 is compressed. At this time, it is not the picture P6 itself, but difference data between the pictures 
P6 and P3, that is compressed. 

The foltowing describes the above-described processing in the sequence in which it is executed. 
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20 In this manner, the encoding sequence is 10, P3. B1, B2, P6, B4, B5, P9. B7, B8, 19, PI 2, BIO, B11, ... and thus 

changed from the disp^y sequence. The compressed data, that is encoded data, are arrayed in this encoded sequence. 
The above is explained in further detail along with the operation of the picture encoding device shown in Fig.1. 
In encoding the first picture 10, data of a picture to be compressed first are outputted by the frame memory 10 and 
blocked by the bkxk dividing unit 11. The bkx:k dividing unit 11 outputs bkx:k-based data in the sequence of YO, Y1 , 
25 Y2, Y3. Cb aTKJ Cr. The bkx:k-based output data is routed via the changeover switch 1 3 set to the side ol the fixed 
terminal a to the DCT circuit 14. The DOT circuit 1 4 orthogonally transforms the bkxk-based data with two-dimensk)nal 
discrete cosine transform. This converts data from the time axis into that on the frequency axis. 

The DC! coeffk:ients from the DCT circuit 14 are sent to the quantizer 15 where it is quantized at a pre-set quan- 
tizatkxi step wkfth. The DCT coeffkaents are then re-arrayed in a zig-zag sequence by the zig-zag scan circuit 16 as 
30 shown in Fig.4. If the DCT coefficients are arrayed in a zig-zag sequence, the coefficient values are those of higher 
f requerK^y components towards the back so that the coefficient values become smaller towards the back. Thus, if the 
coefficient data are quantized at a certain value S, the probability of the result of quantization becoming zero becomes 
higher towards the back so that higher frequency components are removed. 

The quantized coefficients are then sent to the variable length coding (VLC) circuit 17 so as to be processed with 
^ Huffman coding, the resulting compressed bitstream is temporarily stored in the output buffer 18 and thence outputted 
at a pre-set bit rate. The output buffer 1 8 is a buffer memory for outputting an irregularly generated bitstream at a pre-set 
bit rate. 

The above-described compression of a sole picture is termed intra-f rame coding and the resulting picture is termed 
an I -picture. 

40 A decoder receiving the bitstream of the l-prcture performs an operatran which is the reverse of the above-described 

operatkNi for completing the first picture. 

The second picture, that is the picture P3, is encoded in the following manner. 

The second picture and ff. may be compressed as l-pictures to generate a bitstream. However, for improving the 
compresskjn ratra, the second picture ff. are compressed in the following manner for taking advantage of correlatran 
between the contents of the continuous pictures. 

First, the motran detection unit 20 finds, in the first picture 10. a pattern similar to each macro-block constituting the 
second prcture, and represents the pattern in terms of coordinates of relative positions (x, y) termed a motfon vector. 

With the second picture, the respective blocks are not directly sent to the DCT circuit 14, as in the case of the 
I -picture described above. That is. difference data between each block of the secorxl picture and a block referenced 
so from the first picture in accordance with the block-based motion v^tor, that is the difference data of the subtractive unit 
1 2. is sent to the DCT circuit 1 4. 

If the correlatk>n between the pattern of the first prcture represented by the motion vector and the pattern of the 
block now to be encoded is extremely strong, the difference data becomes extremely small, so that the anrraunt of 
compressed data becomes smaller when the motion vector and the difference data are encoded than when the block 
^ is ccxnpressed by the intra-f rame coding. 

The at>ove-described compression method is termed the inter-frame predictive coding. The difference data is not 
necessarily smaller and, depending upon the picture pattern, that is the contents of the picture, the compression ratio 
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becomes higher with intra-f rame coding than with coding the difference data. In such case, data is compressed by the 
Intra-frame coding. Which of the inter-frame predictive coding or the intra-f rame coding is to be employed differs from 
one macro-block to another. 

If the above is to be explained in connection with the picture encoder shown in Fig.1 , the picture which Is the same 

5 as the picture produced on the decoder side needs to be produced on the encoder side at all times if the inter-frame 
predictive coding Is to be achieved. 

To this end. there is provided in the encoder a circuit which Is the same as the decoder This circuit, termed a local 
decoder, includes the inverse DCT circuit 27, inverse DCT circuit 26. addition unit 25. frame memory 22 and the motion 
compensation unit 21 shown In Flg.1 . The picture stored In the frame memory 22 is termed a locally decoded picture or 

10 locally decoded data. The data of a picture not yet compressed is termed an original picture or original data. 

During compression of the first picture, that Is the l-picture 10. the first picture decoded by the local decoder is stored 
in the frame memory 22. Noteworthy is the fact that the picture produced by the local decoder is not the pre^ompression 
data but is the compressed and decoded picture and hence is the same picture as the picture which is to be decoded 
by the decoder and thus suffers from deterioration in the picture quality ascribable to compression. 

IS It is to the encoder under such condition that data of the second picture P3 (original data) is entered. The motion 

vector must have been pre-detected by this time. Such data has the motion vector from block to block. Such motion 
vector is supplied to the motion compensation unit 21. The motion compensatk>n unit 21 outputs data on the locally 
decoded partial picture specified by the motion vector (motion compensated data or MC data: one macro-block) as the 
inter-frame predicted pcture data. 

20 Pixel-based difference data from the subtractive unit 1 2 between the second original data and the motran compen- 

sated data (inter-frame predicted picture data) enters the DCT circuit 14. The method for compression since this time 
is basically the same as that for the l-picture. The picture compressed by the above-described compressing method is 
termed the forward predictive coded picture or P-pk;ture. 

More specifically, all macro-blocks of the P-picture are not necessarily compressed by the inter-frame predictive 

25 coding. If intra-frame coding is judged to give a hi^er coding efficiency for a given macro-block, the macro-block is 
encoded by intra-frame coding. 

That is, with the P-picture. the intra-frame coding or the inter-frame predictive coding is selected for compressk)n 
from one macro-block to arK>ther. The macro-block coded by intra-frame coding and that coded by inter-frame predictive 
coding are termed an intra-macro-bkx:k and an inter-macro-block, respectively. 

30 With the above local decoder, as described above, an output of the quantizer 1 5 is de-quantized by the de-quantizer 

27 and inverse orthogonal transfomned by the inverse DCT circuit 26. During encoding, the nrxjt ion -compensated data 
(MC data) is added to the output of the inverse DCT circuit 26 to give an ultimate k^cally decoded picture. 
The third picture.that is the picture B1, is encoded in the foltowing manner. 

For encoding the third picture (B1 picture), the mc^kxi vector for each of the pictures 10 and P3 is searched. The 
35 mcAfon vector for the picture 10 is termed the fonward vector MVf (x,y). while the motion vector for the picture P3 is temrted 
the backward vector MVb(x,y). 

For this third pk:ture, difference data is similarly compressed. It matters which data shouki be compressed. It is 
sufficient if the difference is taken with respect to a picture which will give the smallest amount of the infomr^ation by 
difference taking. There are four alternatives for the method for compression, namely 

40 

(1 ) employing the difference from data on the picture 10 indicated by the fonward vector MVf(x,y); 

(2) employing the difference from data on the picture P3 indicated by the backward vector MVb(x,y); 

(3) employing the difference between mean values of data on the picture 10 indicated by the fonward vector MVf 
(x.y) and data on the picture P3 indicated by the backward vector MVb(x,y); and 

45 (4) not employing the difference, that is encoding the third picture by intra-frame coding. 

One of these four methods for compression is selected on the macro-block basis. With the alternatives (1) to (3), 
the respective motk)n vectors are sent to the motion compensation unit 21 . The subtractive unit 1 2 takes the difference 
between the macro-block data and the motion-compensated data. The resulting difference is sent to the DCT circuit 1 4. 
so With the attemative (4). the data is directly sent to the DCT circuit 14. 

The above-described operation becomes feasible since two pictures, namely the pictures 10 and P3, have been 
restored by the encoding of the first and second pictures in the frame memory 22 configured for storing the kx:ally 
decoded picture. 

The fourth picture, that is the picture B2, is encoded in the following manner. 
55 The encoding of the fourth picture 82 is carried out in the same manner as for the encoding of the third picture B1 

except that "B1" in the previous description of the encoding method for the third picture B1 now reads "B2". 
The fifth picture, that is the picture PS, is encoded in the following manner. 

The encoding of the fifth picture P6 is carried out in the same manner as for the encoding of the second picture P3 
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except that "PS" and '10' in the previous description of the encoding nnethod for the second picture P3 now read "P6" 
and ■P2*'. respectively 

The encoding of the sixth picture ff . is not nnade since it is the repetition of the above-described encoding operations. 

The MPEG provides a group-of-picture (GOP). 
5 That is, a group of plural pictures is termed a groupof-pictures (GOP). The GOP must consist of continuous pictures 

in terms of the encoded data, that is the compressed data. On the other hand, the GOP takes account of random 
accessing and. to this end, the first picture in the GOP needs to be the l-picture. In addition, the last picture in the GOP 
in the display sequence must be an l-picture or a P-picture. 

Fig.5 shows an example in which the first GOP consists of four pictures and the second GOP ff. consist of six 
10 pictures. Figs.SA and 5B illustrate the display sequence and the sequence of the encoded data, respectively. 

If, in Fig.5, attention is directed to the GOP2, since the pictures B4 and B5 are formed from the pictures P3 and 16, 
the pictures B4 and B5 cannot be decoded correctly if the picture 16 is accessed by random access, since there is not 
the picture P3. The GOP which cannot be correctly decoded within itself is said to be not closed. 

Conversely, if the pictures B4 and B5 only refer to the picture 16, the pictures B4 and B5 can be decoded correctly 
IS if the pictures B4 and B5 access the picture 16 by random access, since the picture P3 is not required. The GOP which 
can be fully decoded from the information within itself is said to be closed. 

Data compression is done by the most efficient one of the above-described compressing methods. The quantity of 
the encoded data thus generated also depends on the input picture and can be known only after actual data compressk>n . 

However, it is also necessary to manage control for providing a constant bit rate of the compressed data The 
20 parameter for managing such control includes the quantization step or quantization scale (Q-scale) as the infornr^tk)n 
specifying the quantity of the codes applied to the quantizer 15. The larger or smaller the quantization step, the smaller 
and larger is the quantity of generated bits for the same compressbn method. 

The value of the c^antization step is controlled in the foltowing manner. 

For assuring a cor>stant bit rate of the compressed data, the output buffer 1 8 is provided at an output of the encoder. 
2S The output buffer operates for absorbing the difference in the difference in the amount of the generated data within a 
certain extent from picture to pk:ture. 

However, if data is generated at a rate surpassing a pre-set rate, the residual data quantity in the output buffer 18 
is increased thus producing data overflow. Conversely, if data is generated at a rate lower than the bit rate, the reskJual 
data quantity in the output buffer is decreased thus ultimately producing data underflow. 
30 Thus the encoder is configured for feeding back the reskJual data quantity in the output buffer 18 so that the quan- 

tizatwn step controller 19 controls the quantization step of the quantizer 15, in such a nrianner that, if the residual data 
quantity in the output buffer 1 8 becomes srrialler, the quantization step is controlled to be smaller to refrain from excessive 
compression and, if the residual data quantity of the output buffer 18 becomes larger, the quantizatksn step is controlled 
to be larger to raise the compressk>n ratb. 
3S On the other hand, there exists a considerable difference in the range erf the quantity of the encoded data generated 

by the above-described compressing method (the above-described intra-frame coding and inter-frame coding). 

If data compression is done by the intra-frame coding, a large quantity of data is produced, so that, if the vacant 
capacity of the output buffer 18 is small, the quantization step size needs to be increased. If the quantizatk>n step size 
is maximized, data overflow in the output buffer 18 may occasionally be produced. If the generated data can be stored 
40 jn the buffer 18, the intra-frame encoded pk:ture produced with a larger quantization step affects the picture quality of 
subsequently produced inter-frame predictively-coded pictures. In this consideration, a suffrcient vacant capacity of the 
output buffer 18 is required before proceeding to data compression by intra-frame coding. 

Thus the compression method of a pre-set sequence is provided, while the quantizatbn step controller 19 effects 
feedback control of the quantizatbn step size for assuring a sufficient vacant capacity of the output buffer 18 prior to 
^ the intra-frame coding. 

The above renders it possible to suppress the bit rate of the encoded data to a pre-set value. 

The intra-frame coded picture is of a larger data size after data compressbn. In general, the larger the number of 
the inter-frame predicted pbtures, the higher becomes the picture quality for a mean bit rate. 

However, in many cases, the intra-frame coded pictures are inserted at a fixed interval so as to be used as a 
so demarcation between GOPs in view of random accessibility. 

On the other hand, with the bi-directionally coded pictures, among the inter-frame predictbn coded pbtures, the 
data size becomes significantly smaller On the other hand, a larger number of the inter-frame predictbn coded pictures, 
in particular the B-pictures, may be introduced for a single intra-frame coded picture, the mean bit rate via the output 
buffer nnay be maintained at a lower value while maintaining a moderate picture quality. 
ss However, with a pfcture exhibiting extremely low correlation with the directly prevbus picture, such as a picture 

directly after a scene change, efficient data compression cannot be achieved with the inter-frame predictive coding. 
Rather, the intra-frame coded picture gives higher compression efficiency. For such picture, the intra-frame coding is 
preferred. 



5 




EP 0 705 041 A2 



However in such case, a picture different from the intra-frame coded picture used for demarcating the GOP is 
inserted as an intra-frame coded picture, resulting in a higher rate of the intra-frame coded pictures to the inter-frame 
predictive coded pictures In the GOP. Thus, if the bit rate of the output data is to be suppressed to below a pre-set value, 
the picture quality is necessarily lowered. 

5 In addition, it is inconvenient if, for demarcating the GOPs for random accessing, scene transitions are not coincident 

with the GOP junction points. 

On the other hand, it is a rare occurrence that, if a sequence is encoded so that it is terminated at a pre-set frame, 
the last picture represents the junction point of the GOP to the next GOP There is also a risk that the last GOP be of 
short duration to impair the picture quality at the ultimate end. 

10 Although it is possible to encode data and feed back the results for re-partitioning the GOPs, this gives only a poor 

encoding efficiency while being unfit for a real-time encoder. 

In addition, there is imposed a limitation that, while the last picture is not necessarily the l-picture or the P-plcture, 
the GOP must be tenrninated by the l-picture or the P-picture. However, re-encoding leads to poor encoding efficiency, 
while encoding of all combinations leads to an increased scale of the apparatus. 

IS One aspect of the present invention provides a picture encoding apparatus comprising pk;ture data storage means 

for storing a plurality of input picture data, scene change detection means for evaluating the quantity of the Information 
of the input pk:ture data from the plural picture data stored in the picture data storage means for detecting a scene 
change, selection means for selecting a GOP of a preset unit length made up of a picture fomied by compression by 
intra-frame coding and a picture fomied by compression by inter-frame predictive coding t>ased upon a scene change 

20 detection output supplied from sakJ scene change detect bn means and compression encoding means for compression 
encoding the input picture data in a pre-set manner in accordance with the GOP selected by the selectbn means. 
Another aspect of the present invention is set forth in claim 5. 

Embodiments of the present invention provide a picture erKXxling apparatus whereby picture compression may be 
achieved with high efficiency to lead to improved picture quality. 
2S Preferably, the selection means is responsive to a scene detectk>n output of the scene change detection means for 

adaptively changing the number of prctures for intra-frame coding and the number of pictures for inter-frame predictive 
coding in the GOP for determining the GOP. Preferably, the selection means changes a given inter-frame predictively 
coded picture to an other inter-frame predictively coded pkiture in the GOP based upon a output of the unit length control 
means. 

30 Preferably, the selection means sets the GOP responsive to the scene change detection output only when the raX\o 

of the number of the pictures for inter-frame predictive coding to the number of pictures for intra-frame predictive coding 
exceeds a pre-set ratkj. In other words, if the ratio of the number of the pictures for inter-frame predictive coding to the 
number of pictures for intra-frame predictive coding is smaller than the pre-set ratio, the length of the pre-set unit Is 
protracted without being coincided with the scene change detection output. 

35 In embodiments of the present invention, the quantity of the information of input picture data is evaluated for detecting 

a scene change and the intra-frame coding Is selected as a compresskjn method responsive to scene change detection 
for raising the compressbn efficiency at a scene change portion of the input picture data where the previous and suc- 
ceeding pk:tures are correlated with each other to a lesser extent. 

In addition, in embodiments of the present Invention, the length of the pre-set unit is preferably changed responsive 

40 to scene change detection for coinciding the scene change with the boundary between the pre-set units. 

Furthermore, if the ratio of the number of the pictures for inter-frame predictive coding to the number of pictures for 
intra-frame predictive coding is smaller than the pre-set ratio, the length of the pre-set unit is protracted without being 
coincided with the scene change detection output, thereby suppressing the bit rate of the output data to a pre-set value 
for maintaining the picture quality. 

45 Embodiments of the invention will now be described, by way of example, with reference to the accompanying draw- 

ings in which: 

Fig. 1 is a schematic block circuit diagram showing an arrangement of a previously proposed picture encoding device. 
Fig.2 illustrates the construction and resolution of a picture. 
Fig.3 illustrates blocks and a macro-block. 
so Fig.4 illustrates zig-zag scan. 

Figs.SA and SB illustrate a typical GOP in the display sequence and In the encoding sequence, respectively. 
Fig.6 is a schematic block circuit diagram of a picture encoding device embodying the present invention. 
Fig.7 is a flow chart for illustcating the process of setting a GOP 

Fig.8 is a fiow chart for illustrating the process of setting picture types for the GOP thus set. 
ss Figs.9A to 9D illustrate typical changes to the picture type at the GOP end. 

Fig.6 shows a schematic arrangement of a picture encoding apparatus (encoder) embodying the present invention. 
In Fig.6, the parts or components whk:h are the same as those shown in Fig.1 are depicted by the same numerals and 
the corresponding description is omitted for clarity. 
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In the arrangement of Fig.6, the components added to the arrangement of Fig. 1 are a scene change detection circuit 
31 , a GOP length control circuit 34 and a compression method selection circuit 32. Af rame memory 40 and a quantization 
step controller are modified from the corresponding components of Fig.1 . 

Thus the picture encoder of the present embodiment includes the frame memory 40. as picture data storage means 
s for storing plural frames of input picture data, and the scene change detection circuit 31 for evaluating the arrwDunt of 
the information of the input picture data from the picture data of plural frames stored in the frame menrK>ry 40 for detecting 
scene changes. The encoder also includes selection means for selecting the compression methods for selection by 
intra-frame coding based upon the supply of a scene change detection output from the scene change detection circuit 
31 , and components from a subtractive unit 12 to a variable length encoding circuit 17, as compression encoding means 
10 for compressing the input picture data in accordance with the compression method as selected by the selection means. 
The encoder also includes a local decoder and motion detection unit 20 and a nrration compensation unit 21. 

The selection means includes a GOP length control circuit 34 as unit length control means for adaptrvely changing 
the number of intra-frame coded pictures and the number of inter-frame predictively coded pictures in a pre-set unit of 
pictures made up of plural pictures (GOP) responsive to a scene change detection output from the scene change de- 
is tectlon output 31 for setting the GOP length, and a compression method selection circuit 32 for selecting the compression 
methocte for respective pictures in the GOP based upon an output of the GOP length control circuit 34. 

The selection means sets the GOP length responsive to the scene change detection output only when the ratio of 
the number of the inter-frame predictively coded pictures to the number of the intra-frame coded pictures surpasses a 
pre-set value. In other words, if the ratio of the number of the inter-frame predictively coded pictures to the number of 
20 the intra-frame coded pictures is lower than the pre-set value, the GOP length is not of a length corresponding to the 
scene change detection output, but is protracted, by way of an example. 

Fief erring to Fig.6, input picture data entering an input terminal 1 is stored in the frame memory 40. The frame 
memory 40 differs from the frame memory 10 of Fig.1 in that it is capable of storing a pre-set number of frames. The 
pre-set number of frames needs to be targer than the number of pictures corresponding to the GOP length when the 
2S number erf inter-frame predictive coded pictures contained in the GOP is sufficient to assure a nrvxierate picture quality 
for a case in which only one intra-frame coded picture is contained in the GOP. In other words, the number of frames 
stored in the frame mentKjry 40 needs to be larger than the number of the inter-frame predictrvely-coded pictures for one 
intra-frarr»e coded pictures which is sufficient to assure a rrKxierate picture quality. That is, the nunrU^er of frames stored 
in the frame memory 40 needs to be such a number as to assure that, if a picture being encoded is encoded by intra-frame 
30 coding, intra-frame coding \s rrot done until a number of inter-frame predictive coded pictures sufficient to eke out or 
caru^el the quantity of data produced by such intra-frame coding may be continuously sent out from the frame memory 40. 

Of course, the pre-set number is rxA limited to that defined above and may be of a larger value for confomning to a 
variety of operating conditions. For example, if the ertcoder is of the type in which rate control is done after pre-setting 
the GOP. the frame memory 40 needs to be of a capacity sufficient to store pictures until setting the GOP length and 
35 starting the encoding. 

The picture data stored in the frame memory 40 are supplied to the scene change detection circuit 31 . 
A scene change may be said to t>e a point in the picture data where correlation between preceding and succeeding 
pictures is impaired significantly- In the present embodiment, the sum of difference values between the current picture 
and the picture following motion vector compensation is found for the entire picture and compared to the corresponding 
40 difference values for the directly previous picture for detecting a scene change. 

For example, for detecting a scene change, roughly two parameters are calculating by way of evaluating the picture 
information. 

The first parameter specifies the amount of the information for enabling the post-compression quantity of data in 
case compression is done by intra-frame coding. The first parameter may be the sum or statistic values of DCT coeffi- 
45 cients obtained by block-based DCT of picture data supplied from the frame memory 40. If the parameter found in this 
manner is of an excessive scale, it may also be the block-based sums of mean square errors of the DCT coefficients. 
In sum, the parameter which represents the quantity of the pictu re information and which is sufficient to infer the post-com- 
pressKjn data quantity is empk>yed as the first parameter. 

The second parameter specifies the quantity of the differential picture information which enables the post-compres- 
50 sbn data quantity to be predicted in case the compression is done by the inter-frame predictive coding. The parameter 
may for example be the in-block sum of the difference values between the picture stored in the frame memory 40 and 
the motion-compensated picture. For calculating the parameter, the least error for detecting the motbn vector, obtained 
in a common motion vector detection circuit (motion detection unit 20) may be employed. 

The GOP length control circuit 34 then controls the GOP length, based upon the scene change detection output 
55 from the scene change detection circuit 31 , as explained later by referring to the flow charts of Figs.7 and 8. 

The GOP length information, as set by the GOP length control circuit 34, is sent to the compression method selection 
circuit 32. 

It is noted that an intra-frame coded picture must be present at least at an initial portion of the GOP. Since the GOP 
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is of a certain length in viewof random accessibility, intra-frame coded pictures occur periodically o"" with scerie changes^ 
Thus the compression method selection circuit 32 is fed not only with the infomiation spec.fy.ng the GOP length frorn 
the GOP lengtti control circuit 34 but also with the picture counts and an output of the scene change detection crcurt 
31 transm'rtted via the GOP length control circuit 34. ■ 

s The compression method selection circuit 32 selects, based upon an output of the scene change detection c.rcu.t 

31 whichoftheintra^ramecodingortheinter-framepredictive-codingshouldbeemployedJnadditiont^ 
method selection circuit 32 introduces the intra-frame coded pictures based upon the GOP length information. 

On the other hand, the quantization step controller 33 controls the quantization step of the quantizer 1 5 based upon 
the infonnation specifying the code quantity from the variable length encoding circuit 17 and the infomnation on the 

10 compression system as selected by the compression method selection circuit 32. „h t«,«cmitc th« 

The quantizer 15 quantizes the OCT output with the abovementioned quantization step width and transmits the 
quantized coefficients to the zig-zag scan circuit 16. ^, „ ,u„ f« tho 

The zig-zag scan circuit 1 6 re-arrays the quantized coefficients by zig-zag scan and transmits the output to he 
variable length encoding circuit 17. The variable length encoding circuit 17 variable length encodes output data of the 

IS ziq-zag scan circuit 1 6 and transmits the resulting output to the buffer 1 8. while transmitting the infomrjation the code 
quantSr as generated by variable length coding to the quantization step controller 33. On the other hand, output data 
from the output buffer 18 is outputted at an output terminal 2 as compressed encoded output data. The encoded output 
data is written by a recording device 35 on a recording medium 36. Although not shown, the recording device 35 includes 
an error correction code appending circuit and a modulating circuit. The recording medium 36 is sp^rf.cally an optK:al 

20 disc which is a master disc for mass-producing ROM discs similar to compact discs. Aftematively. rt is a wntable d.sc^ 
The processing flow in the constmction of the present embodiment is explained by refemng to the flow charts of 

""'^Befort description of the flow charts of Figs.7 and 8. the following definition is given of constants and variables used 

25 *^^e usualtength of a GOP is a constant (DefaultLen). The length of the GOP when a number of the friter-f rame 
predictive-coded pictures sufficient to assure suitable picture quality when there is only one intra-frame coded p^ture 
in the GOP is a variable (MinLen). The shortest boundary demarcating the GOPs is a variable (l^inGOP). 
The longest GOP value for assuring random accessing is a variable (MaxLen). 
The target value of the next GOP demarcation is a variable (TargetLen). 
30 The length from the leading end of the GOP is a variable (CurLen). the picture number in the coding sequence is 

a variable (PicLen). the variable holding the leading picture number in the coding sequence is a variable (GOPbegin 
[n]), and the variable holding the trailing picture number in the coding sequence is a varable (GOPend[n]). 
Although not shown in the flow chart, the following conditions are presupposed. 

In a previous stage, input pictures are sequential^ stored in the frame memory 40 and a scene change is detected. 
35 The objective of scene detection is to detect a picture which cannot be detected efficiently by inter-frame predctive 
coding as described previously. Thus the scene change may be detected by detecting the result of the drfference value 
after motion compensation being rapidly increased as compared to the directly previous difference value. 

For adaptively demarcating the GOPs. pictures for intra-frame coding are not set at the outset, and all pictures are 
motion-detected as P-piclures or B-pictures. depending on M. The pictures are assumed to be arrayed in the coding 
40 sequence at the time of setting of the GOPs. 

The flow chart of Fig.7 is first explained. „ -r, oki t -»i 

At step SI 51 . the variables are first reset so that n = 0. Curt.en = 1 , PicNo = 0. GOPbegin[nl = Pk:No. TargetLen = 
DefaultLen and MinGOP = MinLen. ..^ •. 

At the next step S152, it is checked whether or not the picture is a first P-picture in the GOP. If the picture is found 
45 at step SI 52 to be not the first P-picture. the program transfers to step SI 57 where the picture type is changed from 
the P-picture to the l-picture. If the picture is found at step SI 52 to be the first P-picture. the program transfers to step 
S153 

It is checked at step 81 53 whether or not the current pwture is the last picture in the sequence. If the current picture 
is found at step SI 53 to be the last picture in the sequence, the program transfers to step S154 where the picture is set 
so to be the last picture of the GOR If the current picture is found at step 81 53 to be other than the last picture, the program 

transfers to step SI 58. ^ ^. . ^ • o i< 

At step 81 55 next to step 81 54, it is checked whether or not the last picture in the display order is the P-picture. If 
the last picture in the display order is found at step 81 55 to be other than the P-picture. that is that the last picture is the 
B-picture the program transfers to step 8156. Since the sequence cannot be terminated by the B-picture. the picture 
S5 is changed at step 8156 to a P-plcture. If the last picture is found at step s155 to be the P-picture. the processing is 

termin^^^^ SI 58, to which the program transfers when the current picture is found not to be the last picture, it Is checked 
whether or not the scene change has occurred. If a scene change has been found to have occurred at step S158. the 
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program transfers to step SI 59. If otherwise, that is if no scene change has been found to have occurred, the program 
transfers to step SI 64. 

At step 8159, to which the program transfers when the scene change is found to have occurred at step 3158, the 
variable (CurLen) is compared to the variable (MaxLen). If it is found that CurLen < MaxLen, the program transfers to 
5 step SI 60 and. if otherwise, the program transfers to step SI 63. 

At step SI 60, the variable (CurLen) is compared o the variable (MinGOP). It is found that CurLen < MInGOP, the 
program transfers to step SI 61 and, if otherwise, the program transfers to step SI 63. 

At step SI 61, since the GOP is of a short length and the quantity of data generated with the l-picture cannot be 
distributed, the first picture after a scene change is changed to the intra-f name coded picture. For taking account of data 
10 for the l-picture, newly supplemented for scene change, MinGOP += MinLen and TargetLen += MinLen are set. The 
program then transfers to step S162. 

If, at step SI 63, the result of comparison between the variable (CurLen) and the variable (MaxLen) specifies CurLen 
^ MaxLen, the GOP is divided, taking advantage of the scene change, in order to avoid the GOP becoming too long. 
The program then transfers to step SI 62. 
IS At step S164, to which the program transfers when no scene change is deemed to have occurred at step SI 58. the 

variable (CurLen) is compared to variable (Targetlen). If, at step SI 64, it is judged that CurLen = TargetLen, the program 
transfers to step SI 66. 

At step SI 66. it is checked whether or not the picture type of the next picture is the B-piclure. If the next pfcture is 
found to be the B-picture, the program transfers to step SI 67. Since the sequence cannot be terminated with the B-pk:- 
20 ture. the target GOP length is elongated by one picture. The program then transfers to step SI 62. 

If it is found at step SI 64 that CurLen is not equal to Targetlen. the program transfers to step SI 65 where the variable 
(CurLen) is compared to the variable (TargetLen) + the variable (MinLen). If, at step SI 65. it is found that Curi-en ^ 
TargetLen + MinLen, the program transfers to step S168. If it is found at step S165 that CurLen > Targetlen + MinLen, 
the program transfers to step SI 62. 
25 If the GOP is divided at the target GOP length, there is no scene change within a length of pk:ture data sufficient to 

distribute data generated at the leading l-prcture of the next GOP. Thus the GOP is drvkJed at step SI 68 at the variable 
(TargetLen). After step si 68. the program transfers to step SI 62. 

At step Si 62, the variables (CurLen) and (PrcLen) are counted up. The program then reverts to step SI 52. 

In the f!ow chart shown in Fig.8, the GOP length is set at step SI 69 to a specified value. The variables are initelized 
30 for setting the length of the next GOP. At step SI 70, it is checked whether or not the last pfeture in the GOP in the display 
order is the P-picture. If the last picture is found at step S170 not to be the P-pk;ture, that is. if the last picture is found 
to be the B-pk:ture, the program transfers to step SI 71 . Since the GOP cannot be terminated with the B-pwture. the last 
picture is changed at step S1 71 to the P-picture. If the last picture is found at step SI 70 to be the P-pkrture, the processing 
is terminated. 

3S In the now charts of Figs.7 and 8. the foltowing methods may be used for changing the picture type of the last picture 

of the GOP. 

That is. the last pfcture in the GOP must be an l-picture or a P-prcture. If the GOP is partitioned at e.g., a scene 
change, this is not necessarily the case with the sequence with M = 2 or more. Thus, if the last pbture of the GOP is 
the B-picture. the last B-picture is changed to a P-picture. 

40 By way of an example, it is assumed that the bitstream order is as shown in Fig.9C, with the display order being as 

shown in Fig.9A. and that a scene change has occurred between the pictures B1 3 and P14 in the display order of Fig.9A. 
In such case, the picture B1 3 immediately before the scene change in Fig.9A is changed to the P-picture. such as PI 3, 
as shown in Fig.9B. Similarly, the picture PI 4 immediately after the scene change in Fig.9A is changed to the l-picture, 
such as 114. as shown in Fog.9B. 

4S Since it is difficult to execute encoding a second time in the case of a real-time encoder, when the picture B1 3 is 

changed to the picture PI 3, as shown in Figs.9A and 9B, the B-picture is encoded as a p-picture using only the forward 
vector from the picture P1 1 . TTie picture B1 2 is encoded using only the fonward vector from the pk:ture P1 1 . 

With the picture encoder of the illustrated embodiment, a scene change is detected by evaluating the quantity of 
the informatbn of input picture data and intra-f rame coding is employed as a compression method for a picture corre- 

50 spending to the scene change, thereby raising the compression efficiency at the scene change where the picture cor- 
relation is lowered. 

With the embodiment illustrated, the GOP length is changed responsive to detection of scene change thereby en- 
abling the scene change to be coincident with the GOP boundary point. 

If the ratio of the number of inter-frame predictive-coded pictures to the number of intra-franne coded pictures is 
55 smaller than a pe-set ratio, the GOP length is elongated without being a length corresponding to the scene change 
detection output, thereby suppressing the output data to a pre-set bit rate for maintaining the picture quality. 

With the picture encoding device of the present embodiment, the quantity of the infornnation of the input picture data 
is evaluated for detecting the scene change and the intra-frame coding is used as a compression method for a picture 
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corresponding to the detection of the scene change, thereby raising the compression efficiency at the scene change 
where the correlation between previous and succeeding pictures is lowered, thus assuring highly efficient picture com- 
pressbn and improved picture quality. 

In addition, the length of the pre-set unit is changed responsive to detection of the scene change for enabling the 
scene change to be coincident with the boundary points between the pre-set units. 

Furthermore if the ratio of the number of inter-frame predictive^oded pictures to the number of intra-frame coded 
pictures is smaller than a pre-set value, the GOP length is elongated without being a length corresponding to the scene 
change detection output, thereby suppressing the output data to a pre-set bit rate without lowering the picture quality. 



Claims 

1. A picture encoding apparatus comprising: 

picture data storage means for storing a plurality of input picture data; 

scene change detection means for evaluating the quantity of the information of the input picture data from the 
plural picture data stored in said picture data storage means for detecting a scene change; 
selection means for selecting a GOP of a preset unit length nr^ade up of a picture fomned by compression by 
intra-frame coding and a picture formed by compression by inter-frame predictive coding based upon a scene 
change detection output supplied from said scene change detection means; and 

compression encoding means for compression encoding said input picture data in a pre-set manner in accord- 
ance wrth the GOP selected by said selection means. 

2. The picture encoding apparatus as claimed in claim 1, wherein said selection means is responsive to a scene 
detection output of said scene change detection means for adaptively changing the number of pictures for mtra-f rame 
coding and the number of pictures for inter-frame predictive coding in said GOP for determining the GOP. said 
selection means changing a given inter-frame predictively coded picture to an other inter-frame predictivety coded 
picture in the GOP based upon a output of said unit length control means. 

3 The picture encoding apparatus as claimed in claim 2, wherein said given inter-frame predictively coded picture in 
the GOP is a bi-directionally predictively coded picture (B-picture) and said other inter-frame predictively coded 
picture is a fonwardly predictively coded pkiture (P-picture). 

4 The picture encoding apparatus as claimed in claim 2. wherein said selection means sets the GOP responsive to 
the scene change detection output only when the ratb of the number of the pictures for inter-frame predictive coding 
to the number of pictures for intra-frame predictive coding exceeds a pre-set ratio. 



5. A picture encoding method comprising the steps of: 



storing a plurality of input picture data; 

evaluating the quantity of the information of the input picture data from the plural stored picture data; 
selecting a GOP of a preset unit length made up of a picture formed by compression by intra-frame coding and 
a picture formed by compression by inter-frame predictive coding based upon a scene change detection output; 

and r^r\D 
compression encoding said input picture data in a pre-set manner in accordance with the selected GOP. 

6 The picture encoding method as claimed in claim 5. wherein said step of selecting the GOP adaptively changes the 
number of pictures for intra-frame coding and the number of pictures for inter-frame predictive coding in said GOP 
responsive to a scene detection output of said scene change detection means for setting the GOP and changes a 
given inter-frame predictively coded picture to an other inter-frame predictively coded picture in the GOP based 
upon a output of said unit length control means. 

7. The picture encoding method as claimed in claim 6. wherein said given inter-frame predictively coded picture in the 
GOP is a bi-directionally predictively coded picture (B-picture) and said other inter-frame predictively coded picture 
is a forwardly predictively coded picture (P-p*icture), 

a The picture encoding method as claimed in claim 6, wherein said GOP selection step sets the GOP responsive to 
the scene change detection output only when the ratio of the number of the pictures for inter-frame predictive coding 
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to the number of pictures for intra-frame predictive coding exceeds a pre-set ratio. 

9. A recording mediunr) having recorded thereon encoded signals formed by the picture encoding method as claimed 
in any one of claims 6 to 8. 
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